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METHOD FOR COMPRESSING HIGH REPETITIVITY DATA, IN PARTICULAR 
DATA USED IN MEMORY DEVICE TESTING 

Priority Claim 

5 

[1] This application claims priority from European patent application 
No. 02425660.4, filed October 30, 2002, which is incorporated herein by reference. 

Technical Field 

10 

[2] The present invention relates to a method for compressing high repetitivity 
data. In particular, the present invention finds advantageous, but non-exclusive, 
application in the compression of data used in memory device testing, to which the 
ensuing treatment will make explicit reference without, however, this implying any 
15 loss in generality. 

Background 

[3] As is known, a memory device testing basically involves execution of two 
20 types of operations: verifying possible read fails, which, as is known, are caused by 
defects that are eliminated by resorting to redundancy elements purposely 
provided in the memory device design stage, and, in the event of read fails being 
detected, analysing such fails so as to perform a sort of screening of the defects 
that arise during the manufacture of the memory device. 

25 [4] Read fail analysis basically envisages executing three operations: storing in 
the memory device a bitmap having a regular pattern, for example, a bitmap 
containing all zeros or all ones, or else all zeros except for a diagonal of ones, 
alternating zeros and ones (checkerboard pattern), etc., then reading the bitmap 
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from the memory device, and, finally, analysing the deviation of the bitmap from the 
stored bitmap. 

[5] Read fail analysis, as well as verify of the possible presence of read fails, is 
performed using a tester connected on one side to the memory device to be tested 
5 and on the other side to a personal computer or a workstation. 

[6] FIG, 1 schematically illustrates by way of example the circuit arrangement 
for testing an embedded memory device, i.e., a memory device which, in use, is 
associated to an electronic device, typically a microcontroller or a dedicated logic, 
which sends to the memory device the addresses of the memory locations in which 
10 the data to be read are stored and acquires the data outputted by the latter. 

[7] In particular, in FIG. 1, the reference number 1 designates the embedded 
memory device; number 2 designates the memory to be tested forming part of the 
embedded memory device 1, which may be either of a nonvolatile type, for 
example a Flash memory, or of a volatile type, for example a static memory 
15 (SRAM) or a dynamic memory (DRAM); number 3 designates the microcontroller, 
which is associated to the memory 2 and also forms part of the embedded memory 
device 1\ number 4 designates the tester; and finally number 5 designates the 
personal computer. 

[8] Tester 4 carries out autonomously the entire operation of verifying the 
20 possible presence of read fails, whilst, during the subsequent operation of read fail 
analysis, basically carries out the functions of accumulating the compressed data 
and generating a corresponding file, which is then supplied to the personal 
computer or to the workstation for a subsequent processing, for example, display 
or, if need be, a specific processing. , 

25 [9] In detail, microprocessor 3 reads the state of the memory 2 and compresses 
the read data, and the compressed data are then stored in the internal memory of 
the microprocessor 3. When data compression is completed or the internal 
memory of the microprocessor 3 is full, a signal indicating this situation is 
generated by the microprocessor 3 and supplied to the tester 4, which reads from 

30 the internal memory of the microprocessor 3 the compressed data and saves them 
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in a file. At the end of the operations, the file thus generated is supplied to the 
personal computer or to the workstation 5. 

[10] The testers 4 currently available on the market differ, among other things, 
basically for the speed with which they are able to supply to the personal computer 
5 5 the data read from the memory device 2. In particular, amongst the testers 
currently available on the market, the ones which are operatively slower require 
even around 100 ms for supplying to the personal computer 5 even just one datum 
read, in so far as they use a serial communication protocol and have an extremely 
simple internal logic. 

10 [11] A very effective method that is currently used to increase the amount of data 
transmitted in unit time by the tester 4 to the personal computer 5 is represented by 
the implementation of the so-called data compression algorithms, the most widely 
used of which are essentially based upon the principle of sending to the personal 
computer just the bits or groups of bits for which a read fail has been verified and a 

15 corresponding address with respect to a reference bitmap. 

[12] The data compression algorithms based upon comparison of the bitmap 
read with a reference bitmap require first of all generation of the reference bitmap 
and their performances depend to a large extent upon the degree of "resemblance" 
of the bitmap read with the reference bitmap, and, in particular, their performances 
20 are satisfactory only in the case in which the bitmap read is not very different from 
the reference bitmap. 

Summary 

25 

[13] An aim of the present invention is therefore to provide a data compression 
method that is not based upon the criterion of comparison between bitmaps. 

[14] According to one aspect of the present invention, a method is provided for 
compressing high repetitivity data, as defined in Claim 1 . 

30 
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Brief Description of the Drawings 

5 [15] For a better understanding of the present invention, there is now described a 
preferred embodiment, purely by way of non-limiting example and with reference to 
the attached drawings, in which: 

[16] FIG. 1 schematically shows a circuit arrangement for testing an embedded 
memory device; 

10 [17] FIGS. 2a and 2b show flowcharts of a compression method according to 
one embodiment of the present invention; 

[18] FIGS. 3a and 3b show data formats used in encoding a sequence of 
repetitive data according to a first embodiment of the present invention; 

[19] FIGS. 4a, 4b, 4c, 4d and 4e show an example of the encoding of a 
15 sequence of repetitive data according to the first embodiment of the present 
invention; 

[20] FIGS. 5a to 5m show another example of the encoding of a sequence of 
repetitive data according to the first embodiment of the present invention; 

[21] FIGS. 6a, 6b, 6c and 6d show data formats used in encoding a sequence of 
20 repetitive data according to a second embodiment of the present invention; and 

[22] FIGS. 7a, 7b, 7c and 7d show an example of the encoding of a sequence of 
repetitive data according to the second embodiment of the present invention. 



25 

Detailed Description 

[23] The following discussion is presented to enable a person skilled in the art to 
make and use the invention. Various modifications to the embodiments will be 
30 readily apparent to those skilled in the art, and the generic principles herein may be 
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applied to other embodiments and applications without departing from the spirit 
and scope of the present invention. Thus, the present invention is not intended to 
be limited to the embodiments shown, but is to be accorded the widest scope 
consistent with the principles and features disclosed herein. 

5 [24] As is illustrated in FIG. 2a, a compression method according to an 
embodiment of the present invention essentially envisages execution of two 
operations: 

recognizing a sequence of repetitive data (block 10), whereby, as the data 
are read from the memory device under test, it is determined whether individual 
1 0 data or groups of data are repeated according to a sequence; and 

sequence encoding (block 20), whereby the recognized sequence of 
repetitive data or repetitive groups of data is appropriately encoded. 

[25] In particular, recognizing a sequence of repetitive data is carried out by 
means of a buffer having a given data storage capacity and, as is illustrated in 
15 FIG,2b, basically envisages execution of two operations: 

recognizing a repetition of an individual datum (block 30); and 
subsequently recognizing a repetition of groups of data (block 40). 

[26] As regards recognizing a repetition of an individual datum, each datum read 
from the memory device is stored temporarily in the buffer together with a counter 
20 indicating the number of times the datum is repeated. 

[27] Considering, for instance, 16-bit data, hereinafter represented for reasons of 
simplicity with the hexadecimal notation, if, for example, the data sequence AAA, 
BB, AAA, BB were read from the memory device, in the buffer there would be 
stored the datum "A" and a corresponding counter set at the value "3", then the 
25 datum "B" and a corresponding counter set at the value "2, then again the datum 
"A" and a corresponding counter set at the value "3", and finally again the datum 
"B" and a corresponding counter set at the value "2". 

[28] Based on the contents of the buffer, a possible repetition of groups of data is 
then recognized. In the example considered above, the presence of the repetition 
30 twice of the sequence consisting of the datum "A" that is repeated three times and 
of the datum "B" that is repeated twice is recognized. 
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[29] As regards repetition encoding, this can be carried out in two different ways, 
based on the format of the compressed data that is to be obtained, and, in 
particular, based on whether the compressed data have or not the same length as 
the data to be compressed. 

5 [30] In particular, FIGS. 3a and 3b show two data formats used in encoding a 
data sequence according to a first embodiment of the present invention, in which 
the length of the compressed data is greater than the length of the non- 
compressed data and, in particular, the compressed data are longer than the non- 
compressed data by one bit. For reasons of simplicity, FIGS. 3a and 3b show the 
10 encoding of 16-bit data, it being however clear that what has been said can be 
applied in a similar manner to data of any length. 

[31] In particular, the encoding of a data sequence according to the first 
embodiment of the present invention is carried out using in combination one or 
more words with a format for non-compressible data and one or more words with a 
1 5 format for compressible data. 

[32] FIG. 3a shows a word having a format for non-compressible data. According 
to what is shown in FIG. 3a, a word having a format for non-compressible data is 
made up of seventeen bits defined as follows: 

the most significant bit is set at the logic value "1"; and 
20 the remaining sixteen bits are the sixteen bits of the datum. 

[33] FIG. 3b illustrates, instead, a word having a format for compressible data. 
According to what is shown in FIG. 3b, a word having a format for compressible 
data is made up of seventeen bits defined as follows: 

the most significant bit is set at the logic value "0"; 
25 the next five most significant bits indicate the total number of subsequent 

words that, together with the present word, encode the sequence of repetitive data; 
and 

the remaining eleven bits indicate the number of times the words indicated 
by the five bits referred to above are repeated. 

30 [34] In order to understand the modality with which the various words that 
encode a data sequence are generated, reference can be made to FIGS. 4a-4 , 
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which show the encoding of a data sequence that is repeated twenty-one times 
and is formed by the datum "0", which is repeated seven times, followed by the 
datum "53", which is repeated six times. 

[35] The encoding of the above data sequence requires the generation of five 17- 
5 bit words formed in the way described in what follows. The procedure for 
determining the number of words necessary for encoding a sequence will emerge 
more clearly from the following description. 

[36] The first word, shown in FIG. 4a, is used for indicating the total number of 
words that are needed to encode the data sequence and the number of repetitions 
10 of the sequence, and, in particular, has the following structure: 

the most significant bit is set at the logic value "0" to indicate that the word 
belongs to the encoding of a compressed sequence; 

the next five most significant bits indicate the total number of words following 
the present one, in the case in point four, which encode the repetition of the data 
1 5 sequence referred to above; and 

the remaining eleven bits indicate the number of repetitions, in the case in 
point twenty-one, of the words following the present one and indicated by the five 
bits referred to above, in the case in point four, which encode the repetition of the 
sequence. 

20 [37] The second word and the third word, shown in FIGS. 4b and 4c, 
respectively, are used to encode the repetition of the datum "0". In particular, the 
second word indicates the number of words following it, which make up the 
encoding of the repetition of the datum "0" and the number of repetitions of said 
datum, and, in particular, has the following structure: 

25 the most significant bit is set at the logic value "0" to indicate that the word 

belongs to the encoding of a compressed sequence; 

the next five most significant bits indicate the number of words following the 
present one, in the case in point one, which encode the datum "0"; and 

the remaining eleven bits indicate the number of repetitions, in the case in 

30 point seven, of the words following the present one and the number of which is 
indicated by the five bits referred to above, in the case in point one, which encode 
the datum "0". 
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[38] The third word represents the datum "0", and, in particular, has the following 
structure: 

the most significant bit is set at the logic value "1" to indicate that the word 
belongs to the encoding of a non-compressed datum; and 
5 the next sixteen bits represent the datum "0". 

[39] The fourth word and the fifth word, shown in FIGS. 4d and 4e, respectively, 
are used for encoding the repetition of the datum "53". In particular, the fourth word 
indicates the number of words following it, which make up the encoding of the 
repetition of the datum "53" and the number of repetitions of said datum, and, in 

10 particular, has the following structure: 

the most significant bit is set at the logic value "0" to indicate that the word 
belongs to the encoding of a compressed datum; 

the next five most significant bits indicate the number of words following the 
present one, in the case in point one, which encode the datum "53"; and 

15 the remaining eleven bits indicate the number of repetitions, in the case in 

point six, of the words following the present one and the number of which is 
indicated by the five bits referred to above, in the case in point one, which encode 
the datum "53". 

[40] The fifth word represents the datum "53", and, in particular, has the following 
20 structure: 

the most significant bit is set at the logic value "1" to indicate that the word 
belongs to the encoding of a non-compressed datum; 

the remaining sixteen bits represent the datum "53". 

[41] In brief, the third word represents the datum "0", whilst the second word 
25 indicates the number of times the datum represented in the third word is repeated, 
the fifth word represents the datum "53", whilst the fourth word indicates the 
number of times the datum represented in the fifth word is repeated, and finally the 
first word indicates the number of times the four subsequent words are repeated. 

[42] As may be noted, therefore, the words that make up the encoding of the 
30 data sequence repetition taken as example are arranged according to a nested 
structure, in which: the third and fifth words constitute the innermost nestings; the 
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second and fourth words constitute the two intermediate nestings, which indicate 
the number of repetitions of the two innermost nestings, in particular, the first 
intermediate nesting being external to the first innermost nesting and indicating the 
number of repetitions of the latter, and the second intermediate nesting being 
5 external to the second innermost nesting and indicating the number of repetitions 
of the latter; finally, the first word constitutes the external nesting, which indicates 
the number of repetitions of the two intermediate nestings. 

[43] In particular, each innermost nesting is made up of a word representing a 
datum that is repeated, whereas the intermediate nesting external to this is made 
10 up of a word indicating, on the one hand, the number of subsequent words that 
make up the respective innermost nesting, generally a single word, and, on the 
other hand, the number of times the words that make up the respective innermost 
nesting are repeated. 

[44] In addition, whatever the nesting level, repetition of a nesting involves 
15 generation of a respective outer nesting formed by a word indicating, on the one 
hand, the number of subsequent words that make up the respective inner nesting 
and, on the other hand, the number of times the said words are repeated. 

[45] In order to further clarify what has been said above, another example is 
illustrated in FIGS. 5a-5m, which refer to the encoding of the following repetition of 

20 data sequence: {[A(5), B(6)](7), [C(3), D(4)](6)}(10), i.e., a data sequence that 
repeats in all ten times and is made up of two sub-sequences, in which the first 
sub-sequence is repeated seven times and is formed by the datum "A", which is 
repeated five times, and the datum "B", which is repeated six times, whilst the 
second sub-sequence is repeated six times and is formed by the datum "C", which 

25 is repeated three times, and the datum "D", which is repeated four times. 

[46] In accordance with the above description, to encode said repetition, in all 
eleven words are required, in which eight words are needed to encode the 
repetitions of the four data "A", "B", "C", and "D" (two words for encoding the 
repetition of each datum), one word is needed for encoding the repetition of the 
30 sub-sequence formed by the repetitions of "A" and "B", one word is needed for 
encoding the repetition of the sub-sequence formed by the repetitions of "C" and 
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"D", and, finally, one word is needed for encoding the repetition of the sequence 
formed by the repetitions of the aforesaid sub-sequences. 

[47] As may be noted, the first word (FIG. 5a) indicates that the encoding of the 
entire sequence is made up of ten words in addition to the first word (for a total of 
5 eleven words) and that said words are repeated ten times (number of repetitions of 
the sequence), the words from the second to the sixth (FIGS. 5b-5f) regard the 
encoding of the first sub-sequence, whilst the words from the seventh to the 
eleventh (FIGS. 5g-5m) regard the encoding of the second sub-sequence. 

[48] As regards the first sub-sequence, the second word (FIG. 5b) indicates that 
10 the encoding is made up of four words in addition to the second one (for a total of 
five words) and that said words are repeated seven times (number of repetitions of 
the first sub-sequence), the third word and the fourth word (FIGS. 5c and 5d) 
regard the encoding of the repetition of the datum "A", whilst the fifth word and the 
sixth word (FIGS. 5e and 5f) regard the encoding of the datum "B". In particular, 
15 the third word (FIG. 5c) indicates that the encoding of the datum "A" is made up of 
one word in addition to the third word (for a total of two words) and that the 
following word is repeated five times (number of repetitions of the datum "A"), 
whilst the fourth word (FIG. 5d) indicates the datum "A", just as the fifth word (FIG. 
5e) indicates that the encoding of the datum "B" is made up of one word in addition 
20 to the fifth word (for a total of two words) and that the following word is repeated six 
times (number of repetitions of the datum "B"), whilst the sixth word (FIG. 5f) 
indicates the datum "B" 

[49] As regards the second sub-sequence, the seventh word (FIG. 5g) indicates 
that the encoding is made up of four words in addition to the seventh (for a total of 

25 five words) and that said words are repeated six times (number of repetitions of the 
second sub-sequence), the eighth word and ninth word (FIGS. 5h and 5i) regard 
the encoding of the repetition of the datum "C", whilst the tenth word and the 
eleventh word (FIG. 51 and 5m) regard the encoding of the datum "D". In particular, 
the eighth word (FIG. 5h) indicates that the encoding of the datum "C" is made up 

30 of one word in addition to the eighth word (for a total of two words) and that the 
following word is repeated three times (number of repetitions of the datum "C"), 
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whilst the ninth word (FIG. 5i) indicates the datum "C", just as the tenth word (FIG. 
51) indicates that the encoding of the datum "D" is made up of one word in addition 
to the tenth word (for a total of two words) and that the following word is repeated 
four times (number of repetitions of the datum "D"), whilst the eleventh word (FIG. 
5 5m) indicates the datum "D". 

[50] FIGS. 6a, 6b, 6c and 6d illustrate, instead, the data formats used in the 
encoding of a data sequence according to a second embodiment of the present 
invention, in which the length of the compressed data is equal to the length of the 
non-compressed data. In a way similar to what has been said in regard to FIGS. 3a 
10 and 3b, FIGS. 6a to 6d illustrate the encoding of 16-bit data, it being however clear 
that what has been said can be applied in a similar manner to data of any length. 

[51] The encoding of a data sequence according to the second embodiment of 
the present invention differs from the encoding according to the first embodiment of 
the present invention in that a format for compressible data is used which is 
15 specific for encoding a sequence of zeros, i.e., a sequence of data the bits of which 
all assume a zero logic value, in so far as this is the most frequent datum in the 
memory device testing. 

[52] In particular, FIG. 6a shows a word having a data format specific for the 
encoding of a sequence of zeros. According to what is shown in FIG. 6a, the word 
20 having said specific data format is made up of sixteen bits defined as follows: 
the two most significant bits are set at the logic value "0"; and 
the remaining fourteen bits indicate the number of subsequent data, the 
value of which is "0". 

[53] FIG. 6b illustrates, instead, a word having a format for non-compressible 
25 data. According to what is shown in FIG. 6b, the word having a format for non- 
compressible data is made up of sixteen bits defined as follows: 
the most significant bit is set at the logic value "1"; and 
the remaining fifteen bits are the fifteen bits of the datum. 

[54] Furthermore, if the most significant bit of the datum coincides with the first 
30 bit of the encoded datum, i.e., "1", then a further word is generated as shown in 
FIG. 6c, which is defined as follows: 

n 
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the most significant bit is set at the logic value "1"; and 
the remaining fifteen bits are set at the logic value "0". 

[55] If, instead, the most significant bit of the non-compressible datum is "0", then 
no further word is generated. 

5 [56] Finally, FIG. 6d illustrates a word having a format for compressible data. 
According to what is shown in FIG. 6d, the word having a format for compressible 
data is made up of sixteen bits defined as follows: 

the most significant bit is set at the logic value "0", and the very next bit is set 
at the logic value "1"; 

10 the next four most significant bits indicate the total number of subsequent 

words, which, together with the first one, encode the repetition of the datum or of 
the sequence of data; and 

the remaining ten bits indicate the number of times minus one the words 
indicated by the four bits referred to above are repeated. 

15 [57] As may be noted, the format for compressible data used in the encoding 
according to the second embodiment of the present invention is very similar to the 
one used in the encoding according to the first embodiment of the present 
invention and differs from the latter simply in the composition of the various groups 
of bits 

20 [58] As a comparative example of the encodings according to the two 
embodiments of the present invention, FIGS 7a, 7b, 7c and 7d show the words 
used for encoding the same data sequence, the encoding of which, according to 
the first embodiment of the present invention, is illustrated in FIGS. 4a to 4e, i.e., a 
data sequence that is repeated twenty-one times and is formed by the datum "0", 

25 which is repeated seven times, followed by the datum "53", which is repeated six 
times. 

[59] As may be noted, the encoding is made up of four 16-bit words, in which the 
first word regards the encoding of the repetition of said sequence, the second word 
regards the encoding of the datum "0" which is repeated seven times, and the third 
30 word and the fourth word regard the encoding of the datum "53" which is repeated 
five times. 
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[60] In particular, in the first word (FIG. 7a), the two most significant bits are set 
at the logic values "0" and "1" to indicate that the word belongs to the encoding of a 
compressed sequence, the next four most significant bits indicate the number of 
subsequent words belonging to the encoding, in the case in point three, and the 
5 remaining ten bits indicate the number of repetitions minus one of the subsequent 
three words, in the case in point twenty. 

[61] In the second word (FIG. 7b), the two most significant bits are set at the 
logic value "0" to indicate that it is the encoding of the repetition of the datum "0", 
whilst the remaining fourteen bits indicate the number of repetitions of the datum 
10 "0", in the case in point seven. 

[62] In the third word (FIG. 7c), instead, the two most significant bits are set at 
the logic values "0" and "1" to indicate that the word belongs to a compressed 
datum, the next four bits indicate the number of subsequent words that make up, 
together with the fourth word, the encoding of repetition of the datum "53", in the 
15 case in point one word, whilst the remaining ten bits indicate the number of 
repetitions of the following word minus one, in the case in point five. 

[63] Finally, in the fourth word (FIG. 7d), the most significant bit is set at the logic 
value "1" to indicate that it is a non-compressed datum, and the remaining fifteen 
bits indicate the datum "53". 

20 [64] The described compression methods and systems according to 
embodiments of the present invention may be utilized in a variety of different types 
of electronic systems. For example, the methods may be applied to systems for 
testing memory devices and may also be applied to communications systems or 
computer networks where the compression of digital data is desireable. 

25 [65] The advantages that the described embodiments of the present invention 
afford emerge clearly from the foregoing description. 

[66] Finally, it is clear that modifications and variations can be made to what is 
described and illustrated herein, without thereby departing from the scope of the 
present invention, as defined in the annexed claims. 

30 
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